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Abstract — This  paper  addresses  the  problem  of  target  detec¬ 
tion  and  classification,  where  the  performance  is  often  limited 
due  to  high  rates  of  false  alarm  and  classification  error,  possibly 
because  of  inadequacies  in  the  underlying  algorithms  of  feature 
extraction  from  sensory  data  and  subsequent  pattern  classifica¬ 
tion.  In  this  paper,  a  recently  reported  feature  extraction  algo¬ 
rithm,  symbolic  dynamic  filtering  (SDF),  is  investigated  for  target 
detection  and  classification  by  using  unmanned  ground  sensors 
(UGS).  In  SDF,  sensor  time  series  data  are  first  symbolized 
to  construct  probabilistic  finite  state  automata  (PFSA)  that,  in 
turn,  generate  low-dimensional  feature  vectors.  In  this  paper,  the 
performance  of  SDF  is  compared  with  that  of  two  commonly  used 
feature  extractors,  namely  Cepstrum  and  principal  component 
analysis  (PCA),  for  target  detection  and  classification.  Three 
different  pattern  classifiers  have  been  employed  to  compare  the 
performance  of  the  three  feature  extractors  for  target  detection 
and  human/animal  classification  by  UGS  systems  based  on  two 
sets  of  field  data  that  consist  of  passive  infrared  (PIR)  and  seismic 
sensors.  The  results  show  consistently  superior  performance  of 
SDF-based  feature  extraction  over  Cepstrum-based  and  PCA- 
based  feature  extraction  in  terms  of  successful  detection,  false 
alarm,  and  misclassification  rates. 


I.  INTRODUCTION 

Several  feature  extraction  methods  have  been  proposed 
to  generate  patterns  from  time  series  data  for  classification 
purposes.  The  well-known  kurtosis  method  [1]  provides  a  sta¬ 
tistical  measure  of  the  amplitude  of  the  time  series.  In  another 
method  [2],  a  feature  vector  is  constructed  using  the  spectrum, 
where  the  power  spectral  density  and  the  wavelet  coefficients 
are  used  along  with  principal  component  analysis  for  feature 
extraction;  similar  time-frequency  domain  methods  have  been 
proposed  by  other  researchers  [3],  [4].  Recently,  feature  ex¬ 
traction  from  time  series  of  robot  motion  behavior  has  been 
proposed  by  Mallapragada  et  al.  [5],  based  on  symbolic  dy¬ 
namic  filtering  (SDF)  [6],  [7],  where  the  time  series  data  were 
preprocessed  by  Hilbert  transform  that  requires  conversion  of 
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the  real-valued  signal  into  complex-valued  analytic  signal  to 
extract  the  phase  information  [8].  In  the  SDF-based  feature 
extraction,  the  time  series  data  are  first  converted  into  symbol 
sequences,  and  then  probabilistic  finite-state  automata  (PFSA) 
are  constructed  from  these  symbol  sequences  to  compress  the 
pertinent  information  into  low-dimensional  statistical  patterns. 
More  recently,  SDF-based  feature  extraction  from  (wavelet- 
transformed)  time  series  has  been  proposed  by  Jin  et  al.  [9] 
for  target  detection  and  classification  in  border  regions.  The 
rationale  for  using  wavelet-based  methods  is  time-frequency 
localization  and  denoising  of  the  underlying  sensor  time  series. 
However,  this  method  requires  selection  and  tuning  of  several 
parameters  (e.g.,  wavelet  basis  function  and  scales)  for  signal 
pre-processing  in  addition  to  the  size  of  the  symbol  alphabet 
that  is  needed  for  SDF. 

Feature  extraction  based  on  the  concept  of  Cepstrum  [10] 
has  been  reported  as  a  conceptually  simple  and  computa¬ 
tionally  efficient  tool  for  use  in  pattern  classification  [11]. 
The  objective  of  the  current  paper  is  to  make  a  comparative 
evaluation  of  Cepstrum-based,  PCA-based,  and  SDF-based 
feature  extraction  for  target  detection  and  classification  in  the 
border  regions  using  unattended  ground  sensor  (UGS)  systems. 
However,  unlike  the  previous  work  of  Jin  et  al.  [9]  and 
Mallapragada  et  al.  [5],  this  paper  has  used  direct  partitioning 
and  symbolization  on  time  series  data  for  feature  extraction 
without  performing  the  wavelet  transform  or  Hilbert  transform 
for  signal  pre-processing.  The  rationale  is  to  make  SDF  com¬ 
parable  to  Cepstrum  and  PCA  for  feature  extraction,  which  do 
not  usually  preprocess  the  time  series.  In  this  case  (i.e.,  without 
wavelet-transform  or  Hilbert-transform  pre-processing  of  the 
signal),  the  SDF-based  feature  extraction  method  has  only  a 
single  parameter  to  tune,  which  is  the  (finite)  cardinality  of  the 
symbol  alphabet;  this  is  computationally  more  efficient  than 
if  wavelet  or  Hilbert  transformation  is  used.  For  comparison 
of  the  Cepstrum-based,  PCA-based,  and  SDF-based  feature 
extraction,  the  performance  of  the  respective  feature  extractors 
has  been  evaluated  on  the  same  sets  of  field  data  in  conjunction 
with  the  sparse  representation  (e.g.,  t\ -optimization)  [12], 
support  vector  machines  (SVM)  [13],  and  fc-nearest  neighbor 
(k-NN)  [13]  as  the  pattern  classifiers.  These  three  pattern 
classifiers  have  been  employed  to  compare  the  performance  of 
the  feature  extractors  for  target  detection  and  human/animal 
classification  by  unattended  ground  sensor  (UGS)  systems 
based  on  two  sets  of  field  data  that  consist  of  passive  infrared 
(PIR)  and  seismic  sensors  as  explained  below. 

Tools  of  target  detection  and  classification  by  unattended 
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ground  sensor  (UGS)  systems  have  been  extensively  used  to 
monitor  human  activities  for  border  security;  typical  examples 
of  UGS  are  seismic  and  Passive  Infrared  (PIR)  sensors. 
Seismic  sensors  are  suitable  for  long-range  target  detection, 
because  they  are  relatively  less  dependent  on  exogenous 
disturbances  (e.g.,  Doppler  effects),  as  compared  to  acoustic 
sensors  [2]  that  are  prone  to  contamination  by  environmental 
noise.  In  contrast,  PIR  sensors  are  well-suited  for  motion 
detection  and  has  the  advantage  of  low  power  consump¬ 
tion  [14].  Nevertheless  discriminating  human  footstep  signals 
from  other  target  types  and  noise  sources  is  a  challenging 
problem,  because  of  the  rapidly  decreasing  trend  of  signal  to 
noise  ratio  (SNR)  as  the  distance  between  the  sensor  and  the 
moving  target  increases.  Furthermore,  the  footstep  signals  have 
dissimilar  signatures  for  different  environments  and  persons, 
which  make  the  problem  of  target  detection  and  classification 
even  more  challenging  [15]. 

In  contrast  to  the  Cepstrum-based  and  PCA-based  feature 
extraction,  the  advantage  of  the  SDF-based  algorithm  is  that 
it  takes  into  account  the  local  information  of  the  signal  and  it 
is  capable  of  mitigating  noise  in  the  data  even  if  the  signal 
is  not  pre-processed  for  denoising.  It  is  shown  in  a  later 
section  of  the  paper  that  the  SDF-based  feature  extraction 
yields  superior  successful  detection,  false  alarm,  and  overall 
correct  classification  rates  compared  to  Cepstrum-based  and 
PCA-based  feature  extraction. 

This  paper  is  organized  into  five  main  sections  including 
the  present  section.  Section  II  briefly  describes  the  algorithms 
of  Cepstrum-based,  PCA-based,  and  SDF-based  feature  ex¬ 
traction.  Section  III  succinctly  discusses  the  the  three  pattern 
classification  algorithms  that  have  been  used  in  this  paper. 
Section  IV  presents  the  results  of  target  detection  and  classifi¬ 
cation  based  on  two  sets  of  field  data  of  passive  infrared  (PIR) 
and  seismic  sensors.  Section  V  summarizes  and  concludes  this 
paper  with  recommendations  for  future  research. 

II.  Feature  Extractors 

Feature  extraction  from  sensor  signals  is  an  important  step 
in  target  detection  and  classification  which  is  accomplished 
in  this  paper  by  three  alternative  algorithms,  namely  Cep- 
strum  [11],  PC  A  [13],  and  SDF  [6].  While  the  details  of 
these  algorithms  have  been  reported  in  earlier  publications, 
this  section  briefly  reviews  the  underlying  concepts  of  feature 
extraction  from  sensor  time  series  for  completeness  of  this 
paper  and  lists  the  algorithms  that  have  been  used  in  this  paper 
for  feature  extraction. 


A.  Cepstrum  for  Feature  Extraction 

This  subsection  briefly  describes  the  Cepstrum-based  fea¬ 
ture  extraction  that  has  been  widely  used  in  speech  recog¬ 
nition  and  acoustic  signal  classification  [10],  Given  a  signal 
/(/),  t  =  1  Cepstra  are  usually  computed  in  the 

following  form  [16],  which  is  used  in  the  rceps  Matlab  func¬ 
tion  (http://www.mathworks.com/help/signal/ref/rceps.html): 

fc(t)  =  5ft  (J7-1  (log  |.F(w)|))  (1) 


where  F(uj)  is  the  Fourier  transform  of  the  signal  /(f);  the 
operator  F~1  is  the  inverse  Fourier  transform;  and  R(z) 
indicates  the  real  part  of  a  complex  scalar  z. 

A  slightly  modified  version  of  this  algorithm  is  used  here. 
After  obtaining  the  Fourier  transform  of  the  signal,  frequency 
components  with  small  values  are  discarded  before  taking  the 
inverse  Fourier  transform  to  prevent  high  unimportant  compo¬ 
nents  from  gaining  higher  Cepstrum  values.  After  finding  the 
Cepstrum  features,  the  first  Nc  components  are  used  as  Cep¬ 
strum  features  for  classification.  Algorithm  1  for  Cepstrum- 
based  feature  extraction  is  presented  below. 


Algorithm  1  Cepstrum  for  feature  extraction 

Input:  Time  series  data  sets  x  £  R 1  x  ,  Cut-off  sample  Nj  (where  Nj  < 
N):  and  dimension  of  the  Cepstrum  feature  Nc  (where  Nc  <  Nt). 
Output:  Extracted  Cepstrum-based  feature  p  £  Rlx-™=  of  the  time-series  x 
1:  Compute  the  magnitude  of  FFT  |.F(u;)|  of  the  given  time  series  where 
u  =  1, . . . ,  N 

2:  Store  the  first  Nf  frequency  components  and  discard  the  rest 
3:  Compute  fc(t)  =  5ft  (T--1  (log  |F(tu)|))  ,  t  =  1 
4:  Compute  the  feature  p  =  [ fc ( I )  /c(  1)  . . .  fc(Nc)] 


B.  Principal  Component  Analysis  for  Feature  Extraction 

This  subsection  briefly  describes  the  principal  component 
analysis  (PCA)  for  feature  extraction,  which  has  been  widely 
used  in  diverse  applications  [13].  While  the  Cepstrum-based 
feature  extraction  makes  use  of  only  the  information  imbedded 
in  a  time  series,  PCA  takes  advantage  of  the  information  of  the 
ensemble  of  training  data  in  addition  to  the  local  information 
to  extract  the  features  [13].  Let  the  training  data  sets  (i.e., 
the  ensemble  of  time  series)  be  organized  as  an  (M  x  N)- 
dimensional  data  matrix,  where  M  is  the  number  of  training 
data  sets  and  N  is  the  length  of  each  data  set  (i.e.,  1-D  time 
series).  For  the  detection  problem,  the  training  data  consists 
of  both  “No  Target”  and  “Target  Present”  classes  while,  in  the 
human/animal  classification  problem,  the  training  data  consists 
only  of  samples  from  the  “Target  Present”  class. 

Let  X  be  the  centered  version  of  the  original  (M  x  N)  data 
matrix,  where  each  row  of  X  is  an  individual  data  sample 
x  with  zero  mean.  In  the  application  of  target  detection  and 
classification,  M  is  often  smaller  than  N,  i.e.,  the  number 
of  training  samples  are  fewer  than  the  length  of  time-series. 
Therefore,  it  is  numerically  efficient  to  analyze  the  (M  x  M ) 
matrix  (1  /M)XXT  that  has  the  same  nonzero  eigenvalues  as 
the  ( N  x  N)  computed  covariance  matrix  (1  /M)XT X  [13]. 

Let  Vi  be  the  normalized  eigenvectors  of  the  real  symmet¬ 
ric  matrix  (1  /M)XXT  corresponding  to  the  (real  positive) 
eigenvalues  A i  that  are  arranged  in  the  decreasing  order  of 
magnitude,  i.e.,  Ai  >  A2  >  •  •  •  >  Am-  Let  m  be  the  smallest 
integer  such  that  YhL  1  Y  >  V  Yh%L  \  Y  where  1  <  m  <  M, 
where  the  threshold  parameter  p  is  a  real  positive  fraction 
close  to  one.  The  corresponding  (normalized)  eigenvectors  Ui 
in  the  original  data  space  are  obtained  in  terms  of  and  A, 
as  follows  [13]  [5]: 


Ui  = 


XTVi,  i  =  1, 2, . . . ,  m. 


(2) 


2 


The  PCA  projection  matrix  W  €  E  x m  is  then  obtained  by 
grouping  the  computed  eigenvectors  as  follows: 

W  =  [Milt2  .  .  .  Um\  (3) 

The  PCA-based  feature  vector  p£  Klxm  for  a  given  (train  or 
test)  time-series  x  £  RlxAr,  is  then  computed  by  projection 
as  follows: 

p  =  xW.  (4) 

Algorithm  2  for  PCA-based  feature  extraction  is  presented 
below. 


Algorithm  2  Principal  component  analysis  for  feature  extraction 

Input:  Training  time  series  data  sets  Xj  £  Rlx  ,  j  =  1, . . . ,  M;  Tolerance 
r]  £  (0, 1);  and  test  time  series  data  set  x  £  RlxJV 
Output:  Extracted  feature  vector  p  E  Rlxm  for  the  time-series  x 
1:  Construct  the  “centered  version”  training  data  matrix  X  E  M,MxN ,  where 
each  row  Xj  has  zero  mean. 

2:  Compute  the  matrix  S  =  (1/M)XXT 

3:  Compute  the  normalized  eigenvectors  {v^}  of  S  with  their  corresponding 
eigenvalues  {A*}  in  the  decreasing  order  of  magnitude 
4:  Compute  the  normalized  eigenvectors  Uj  =  , *  1 


=  (X)TVi 


Find  the  smallest  m  <  M  such  that  fifff—  j  A i  >  r/  i  \i 
Construct  (IV  X  m)  projection  matrix  W  =  [ui,  U2, um] 
Generate  ( 1  X  m)  reference  patterns  p  =  rW 


C.  Symbolic  Dynamic  Filtering  for  Feature  Extraction 

This  subsection  succinctly  presents  the  underlying  concept 
and  theory  of  symbolic  dynamic  filtering  (SDF)  [6]  for  extrac¬ 
tion  of  (low-dimensional)  features  from  time-series  data.  As 
stated  earlier  in  Section  I,  the  SDF  algorithm  reported  in  [6], 
[7]  includes  wavelet-transformed  pre-processing  of  signals  to 
facilitate  time-frequency  localization  and  denoising.  However, 
inclusion  of  wavelet  transform  requires  tuning  of  additional 
design  parameters,  such  as  the  wavelet  basis  and  a  set  of 
scales.  Since  multiplicity  of  selectable  parameters  decreases 
the  ease  of  usage  and  may  affect  the  performance  of  the 
algorithms,  this  paper  has  applied  SDF  directly  on  the  time- 
series  data  (i.e.,  without  wavelet  preprocessing)  to  make  a  fair 
comparison  of  the  three  feature  extraction  methods  based  on 
the  same  sets  of  data. 

The  pertinent  steps  of  the  SDF  procedure  for  feature  extrac¬ 
tion  are  delineated  below. 

1 )  Symbolization  of  Time  Series:  This  step  requires  parti¬ 
tioning  (also  known  as  quantization)  of  the  time  series  data. 
The  signal  space,  approximately  represented  by  the  training 
data  set,  is  partitioned  into  a  finite  number  of  cells  that  are 
labeled  as  symbols,  i.e.,  the  number  of  cells  is  identically 
equal  to  the  cardinality  |£|  of  the  (symbol)  alphabet  £.  As 
an  example  for  the  one-dimensional  sensor  time  series  data 
in  Fig.  1,  the  alphabet  £  =  {a,/3, 7,<5},  i.e.,  |£|  =  4,  and 
three  partitioning  lines  divide  the  ordinate  (i.e.,  y-axis)  of  the 
time  series  profile  into  four  mutually  exclusive  and  exhaustive 
regions.  These  disjoint  regions  form  a  partition,  where  each 
region  is  labeled  with  one  symbol  from  the  alphabet  £.  If  the 
value  of  time  series  at  a  given  instant  is  located  in  a  particular 
cell,  then  it  is  coded  with  the  symbol  associated  with  that 
cell.  As  such,  a  symbol  from  the  alphabet  £  is  assigned  to 
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Fig.  1.  Concept  of  symbolic  dynamic  filtering  (SDF)  as  a  feature  extractor 


each  (signal)  value  corresponding  to  the  cell  where  it  belongs. 
(Details  are  reported  in  [7].)  Thus,  a  (finite)  array  of  symbols, 
called  a  symbol  string  (or  symbol  block),  is  generated  from 
the  (finite-length)  time  series  data. 

The  ensemble  of  time  series  data  are  partitioned  by  using  a 
partitioning  tool  (e.g.,  maximum  entropy  partitioning  (MEP)  or 
uniform  partitioning  (UP)  methods  [7]).  In  UP,  the  partitioning 
lines  are  separated  by  equal-sized  cells.  On  the  other  hand, 
MEP  maximizes  the  entropy  of  the  generated  symbols  and 
therefore,  the  information-rich  cells  of  a  data  set  are  parti¬ 
tioned  finer  and  those  with  sparse  information  are  partitioned 
coarser,  i.e.,  each  cell  contains  (approximately)  equal  number 
of  data  points  under  MEP.  In  both  UP  and  MEP,  the  choice  of 
alphabet  size  |£|  largely  depends  on  the  specific  data  set  and 
the  allowable  error  of  detection  and  classification. 

In  this  paper,  MEP  has  been  adopted  as  the  partitioning 
tool  to  accommodate  sparsity  of  the  time  series  data  to  be 
symbolized.  For  the  purpose  of  pattern  classification,  the 
training  data  set  is  partitioned  with  a  given  alphabet  size 
|  £  |  that  is  selected  by  trade-off  between  information  loss 
and  computational  complexity  [7]  and  the  partitioning  is 
subsequently  kept  constant  for  the  test  data. 

2)  Construction  of  probabilistic  finite  state  automata 
(PFSA):  The  core  assumption  for  construction  of  probabilistic 
finite  state  automata  (PFSA)  is  that  the  symbolic  process  for 
different  classes  of  targets  can  be  approximated  as  a  Markov 
chain  of  order  D,  called  the  D-Markov  machine,  where  D  is 
a  positive  integer.  While  the  details  of  the  D-Markov  machine 
are  given  in  [6],  the  pertinent  information  on  the  construction 
of  a  D-Markov  machine  is  presented  below. 

A  D-Markov  chain  is  a  statistically  (quasi-)stationary 
stochastic  process  S  =  ■  ■  ■  s_iSo  ■  ■  •  si  •  •  • ,  where  the  prob¬ 
ability  of  occurrence  of  a  new  symbol  depends  only  on  the 
last  D  symbols,  i.e., 

P\Sn  |  Sn—  1  *  *  *  Sn—D  •  •  *  ]  —  P[sn  |  Sn—  1  •  •  •  Sn  — £>] 

The  construction  of  a  D-Markov  machine  is  based  on:  (i)  state 
splitting  that  generates  symbol  blocks,  also  called  words,  of 
different  lengths  according  to  their  relative  importance;  and 
(ii)  state  merging  that  assimilates  histories  from  symbol  blocks 
leading  to  the  same  symbolic  behavior  [17],  Words  of  length 
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D  on  a  symbol  string  are  treated  as  the  states  of  the  D- 
Markov  machine  before  any  state-merging  is  executed.  Thus, 
on  a  (finite)  alphabet  with  cardinality  |£|,  the  total  number  of 
possible  states  is  less  than  or  equal  to  |£|D;  and  operations  of 
state  merging  may  significantly  reduce  the  number  of  states. 
As  the  value  of  D  is  increased,  more  memory  is  imbedded 
in  the  Markov  states  of  the  PFSA.  However,  the  benefits 
of  having  additional  memory  associated  with  a  larger  value 
of  D  could  be  offset  by  the  increased  computational  load. 
So,  one  must  make  a  trade-off  between  the  two  competing 
requirements  of: 

•  Capturing  information  from  the  time  series,  and 

•  Reduction  of  computational  complexity  in  terms  of  mem¬ 
ory  requirements  and  execution  time  in  the  construction 
of  D-Markov  machine  algorithms. 

The  PFSA  states  represent  different  combinations  of  words 
on  the  symbol  sequence.  In  the  graph  of  a  PFSA,  the  direc¬ 
tional  edge  that  interconnects  a  state  (i.e.  a  node)  to  another 
state  represents  the  transition  probability  between  these  states. 
Therefore,  the  “states”  denote  all  possible  words  within  a 
window  of  certain  length.  In  this  paper,  the  word  size  D  is 
taken  to  be  1  and  hence  the  number  of  possible  states  is  equal 
to  the  number  of  symbols;  therefore,  the  results  reported  in  this 
paper  are  limited  to  D  =  1.  However,  in  general,  larger  values 
of  the  integer  parameter  D  (i.e.,  D  >  1)  might  be  necessary  to 
capture  the  long  time-scale  information  about  the  process.  It 
is  noted  that  the  operations  of  state  splitting  and  state  merging 
might  be  necessary  for  PFSA  construction  with  D  >  1,  while 
these  operations  are  not  required  for  D  =  1  [17]. 

As  a  consequence  of  having  D  =  1,  the  number  of  states  is 
equal  to  the  number  of  symbols,  i.e.,  |Q|  =  |£|,  where  the  set 
of  all  possible  states  is  denoted  as  Q  =  {gi,  <72,  •  ■  • ,  Q|q|}  and 
|Q|  is  the  number  of  (finitely  many)  states.  The  (estimated) 
state  transition  probabilities  are  defined  as: 


p{qk  |  qi)  = 


N(qi,qk) 


£i=i,2,...,|Q|  N  (qi,qi) 


Vqklqi&Q  (5) 


where  N  ( qi,qk )  is  the  total  count  of  events  when  qk  occurs 
adjacent  to  qi  in  the  direction  of  motion.  Having  computed  all 
these  probabilities  p  (qk  \  qi)\/qk,qi  £  Q,  the  (estimated)  state 
transition  probability  matrix  of  the  PFSA  is  given  as 


n  = 


p(qi  I  qi) 
p{q  1 1  q\Q\) 


p{q\Q\  I «) 

p(m  I  9iei)  . 


(6) 


By  appropriate  choice  of  partitioning,  it  is  ensured  that  the 
resulting  Markov  chain  model  satisfies  the  ergodicity  condi¬ 
tions,  i.e.,  the  stochastic  matrix  n  is  irreducible.  The  rationale 
is  that,  under  statistically  stationary  conditions,  the  probability 
of  every  state  being  reachable  from  any  other  state  within 
finitely  many  transitions  must  be  strictly  positive  [18].  For  a 
given  time  series,  after  the  matrix  n  is  constructed,  its  left 
eigenvector  p  corresponding  to  the  (unique)  unity  eigenvalue 
is  computed.  Then,  the  vector  p,  which  is  the  stationary  state 
probability  vector,  serves  as  the  “feature”  vector  extracted 
from  the  time  series  as  seen  in  Fig.  1.  This  feature  vector 
p  is  used  for  pattern  classification  in  the  sequel. 


3)  Formulation  of  SDF -based  feature  extraction  algorithms: 
The  SDF-based  feature  extraction  is  executed  as  a  combination 
of  two  algorithms: 

•  Maximum  entropy  partitioning  (MEP)  of  time  series  data. 

•  Symbolic  dynamic  filtering  (SDF)  for  feature  extraction, 
which  makes  use  of  the  MEP  algorithm. 

Algorithm  3  and  Algorithm  4  for  SDF-based  feature  extrac¬ 
tion  are  presented  below. 


Algorithm  3  Maximum  entropy  partitioning 

Input:  Finite-length  string  y  of  time-series  data;  and  number  of  symbols  |£| 
Output:  Partition  vector  p  £  RIeI+1 
1:  Sort  the  time  series  data  string  x  in  the  ascending  order 
2:  Let  K=\ength(y) 

3:  Assign  p(l)  =  y ( 1 ) .  i.e.,  minimum  element  of  y 
4:  for  i= 2  to  |£|  do 

5:  p(i)  =  y  ^ ceil 

6:  end  for 

7:  Assign  p(|£|  +  1)  =  y(K),  i.e.,  maximum  element  of  y 


Algorithm  4  Symbolic  dynamic  filtering  for  feature  extraction 

Input:  Training  time  series  data  sets  Xj  E  Mlx  =  1, . . . ,  M,  a  test  time 
series  data  set  x  E  M.lxN ,  and  number  of  symbols  |£| 

Output:  Extracted  SDF-based  feature  vector p  £  Rlxlsl  for  the  time-series 
x 

1 :  Initialize  y  =  0 

2:  for  j  =  1  to  M  do 

3:  y  =  y\Jxi 

4:  end  for 

5:  Partition  y  using  Algorithm  3  to  obtain  the  (common)  partition  vector  p 

6:  Use  p  on  the  test  data  set  x  to  obtain  the  symbol  string  s 

7:  Construct  the  (irreducible)  state  transition  probability  matrix  n  by  using 
Eqs.  (5)  and  (6) 

8:  Compute  the  (sum-normalized)  left  eigenvector  p  corresponding  to  the 
(unique)  unity  eigenvalue  of  II 


III.  Target  detection  and  Classification 

Two  classification  problems  are  addressed  in  this  paper,  both 
of  which  are  of  binary  type.  In  the  first  problem  of  target 
detection,  the  output  of  the  classifier  is  either  “no  target”  or 
“target  present”;  and  the  second  problem  is  about  classifying 
the  target  as  “human”  or  “animal”  led  by  a  human.  In  an  earlier 
publication  [9]  on  this  topic,  these  two  classifiers  are  corre¬ 
lated,  i.e.,  the  human\  animal  classification  is  performed  only 
after  a  target  is  detected,  regardless  of  whether  this  detection 
is  correct  or  false.  However,  the  current  paper  treats  these  two 
classification  problems  separately,  because  the  main  objective 
here  is  to  compare  the  performance  of  Cepstrum,  PCA  and 
SDF  as  feature  extractors  and  thus  treating  the  detection  and 
classification  problems  separately  would  yield  unambiguous 
comparison.  In  this  context,  three  different  classifiers  that 
have  been  used  in  conjunction  with  each  of  Cepstrum,  PCA 
and  SDF  feature  extractors  are:  (i)  support  vector  machines 
(SVM)  [13],  (ii)  k-nearest  neighbor  (k-NN)  [13],  and  (iii) 
sparse  representation  classifier  (SRC)  [11]  that  are  briefly 
reviewed  in  the  following  paragraphs. 
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A.  Support  vector  machine  (SVM)  and  k-nearest  neighbor  (k- 
NN)  algorithms 

The  SVM  and  k-NN  algorithms  are  among  the  most  fre¬ 
quently  used  tools  of  pattern  classification  [13].  For  /e-NN,  the 
neighborhood  size  k  is  preferably  kept  small  to  avoid  the  over- 
fitting  problem.  Similar  to  the  earlier  work  of  Jin  et  al.  [9], 
the  SVM  method  has  also  been  used  in  the  current  paper  to 
regenerate  the  results  and  compare  the  performance  with  other 
classification  algorithms.  A  Gaussian  kernel  has  been  used  for 
SVM,  similar  to  what  was  done  in  [11]  for  target  classification 
with  UGS  systems. 

B.  Sparse  representation  classification  (SRC)  algorithm 

The  SRC  algorithm  was  first  used  for  solving  the  prob¬ 
lem  of  face  recognition  [12],  where  a  large  matrix  A  = 
[A4A2  •  •  •  Ac],  consisting  of  the  training  data,  is  constructed 
where  Ai,i  £  { 1 ,  •  •  -  ,C}  is  a  (n  x  Ni)  training  matrix 
consisting  of  the  training  samples  belonging  to  the  ith  class,  n 
is  the  dimension  of  the  feature  vector,  and  Ni,  i  £  {1,  ■  ■  •  ,  C} 
is  the  number  of  training  samples  in  the  class  i,  and  C  is  the 
total  number  of  class  labels.  It  is  assumed  that  a  test  sample 
from  the  ith  class  lies  approximately  within  the  subspace 
formed  by  the  training  data  of  the  ith  class.  For  a  given  test 
vector  y  in  the  sparse  classification  algorithm,  the  following 
(■\  -optimization  problem  [19]  needs  to  be  solved  as: 

min  HxH^  such  that  ||y  —  Ax||^2  <  e 

In  the  above  optimization  problem,  the  user-selected  parameter 
e  is  a  representative  of  the  upper  bound  of  the  noise  spectrum 
in  the  data  [20],  where  the  optimal  solution  x  of  the  above 
t\  optimization  is  shown  to  be  a  sparse  vector.  For  x  £ 
Rm,  where  M  is  the  total  number  of  training  samples,  let 
Si(x)  £  be  a  new  vector  whose  only  non-zero  elements 
are  the  entries  in  x  that  are  associated  with  class  i  in  A. 
In  the  noiseless  scenario,  if  the  test  data  y  belongs  to  the 
ith  class,  then  Sj(x)  should  be  a  zero  vector  for  all  possible 
j  ^  i.  However,  since  noisy  data  are  encountered  in  real-life 
applications,  the  residuals  \\y— ASi(x)\\2  needs  to  be  computed 
for  classification  of  the  test  sample  y  and  the  label  of  the  test 
vector  y  is  predicted  as  the  argument  i*  that  is  the  minimizer 
of  the  residuals. 

One  of  the  main  advantages  of  the  SRC  algorithm  is  that 
a  careful  selection  of  the  features  is  not  necessary.  What 
is  critical,  however,  is  whether  the  number  of  features  is 
sufficiently  large  and  whether  the  sparse  representation  is 
correctly  computed.  The  SRC  algorithm  is  also  robust  in 
dealing  with  noisy  data  [20]. 

IV.  Results  and  Discussion 

This  section  presents  the  results  of  target  detection  and 
classification,  which  were  generated  from  two  sets  of  field 
data,  each  consisting  of  time  series  generated  from  passive 
infrared  (PIR)  and  seismic  sensors  in  different  sensor  sites. 
Each  of  the  test  scenarios  consists  either  of  the  two  classes 
of  targets:  (i)  human  walking  alone,  and  (ii)  animal  led  by  a 
walking  human.  These  targets  moved  along  an  approximately 
150  meters  long  trail  and  returned  along  the  same  trail  to  the 


starting  point;  all  targets  passed  by  the  sensor  sites  at  a  distance 
of  approximately  5  meters.  Signals  from  both  PIR  and  seismic 
sensors  were  acquired  at  a  sampling  frequency  of  10  kHz  and 
each  test  was  conducted  over  a  period  of  approximately  50 
seconds. 

The  field  data  set  #1  was  collected  on  three  different  days, 
Day#l,  Day#2  and  Day#3,  from  test  fields  on  a  wash  (i.e., 
the  dry  bed  of  an  intermittent  creek)  and  at  a  choke  point 
(i.e.,  a  place  where  the  targets  are  forced  to  go  due  to  terrain 
difficulties).  In  contrast,  the  field  data  set  #2  was  collected  on 
four  different  days,  Day#l,  Day#2,  Day#3  and  Day#4,  from 
test  fields  on  different  types  of  terrains  such  as  wash,  trail  and 
watering  stations.  Table  I  lists  the  numbers  of  different  labeled 
scenarios  for  each  day  for  both  data  sets  #1  and  #2. 

The  data  sets  #1  and  #2  have  been  analyzed  for  performance 
comparison  of  the  feature  extractors,  Cepstrum,  PCA,  and  SDF 
in  conjunction  with  the  (binary)  classifiers,  SVM,  k-NN,  and 
SRC  that  have  the  following  respective  design  parameters: 

•  SVM:  Variance  of  the  Gaussian  kernel  and  regularization 
parameter  [21],  [13]  ; 

•  k-NN:  Neighborhood  size  k  [13]; 

•  SRC:  Error  upper  bound  e  [11]. 

Figure  2  presents  typical  time  series  and  extracted  features 
from  PIR  and  seismic  sensor  signals,  where  individual  rows 
represent  different  scenarios  and  the  individual  columns  repre¬ 
sent  the  following:  sensor  signals  in  Column  1  along  with  the 
extracted  features  generated  by  Cepstrum  in  Column  2,  PCA 
in  Column  3,  and  SDF  in  Column  4. 

Let  JVi  be  the  total  number  of  test  samples  for  the  Class 
1  target  and  let  n±  (where  0  <  n\  <  N{)  be  the  number  of 
misclassifications  of  the  Class  1  target  as  belonging  to  Class 
2;  similarly,  let  N2  and  77-2  (where  0  <  712  <  N2)  be  the 
corresponding  parameters  for  the  Class  2  target.  The  goal  is 
to  select  the  design  parameter  of  a  classifier  by  making  a  trade¬ 
off  between  two  conflicting  objectives  of  minimizing  misclas¬ 
sifications  of  Class  1  and  Class  2  targets,  i.e.,  classifying  a 
target  to  one  class  when  it  truly  belongs  to  the  other  class. 
Let  I\  =  and  P2  —  be  the  estimated  probabilities  of 
misclassifying  Class  1  and  Class  2  targets,  respectively.  In  this 
context,  the  composite  objective  function  is  selected  to  be  a 
weighted  linear  combination  of  the  misclassification  rates. 

J(a)  =  a P\  +  (1  -  a)P2  (7) 

where  a  £  (0, 1)  is  the  (user-selected)  trade-off  parameter;  and 
J  (a)  is  the  cost  functional  to  be  minimized  with  respect  to  the 
classifier  design  parameter.  Depending  on  how  the  parameter  a 
is  selected,  minimization  of  J  (a)  may  lead  to  different  optimal 
solutions  for  Pi  and  P2 ■  Given  N\  and  N2  and  selecting  a  = 
(  Ai+V2)’ the  cost  functional  J  in  Eq.  (7)  becomes  ( 
that  is  the  total  misclassification  rate  of  combined  Class  1  and 
Class  2  targets. 

For  each  field  data  set,  parameters  of  the  feature  extraction 
algorithms  are  chosen  by  numerical  experiments  and  are  kept 
invariant  for  all  tests.  For  Cepstrum  method  (see  Algorithm  1 
in  Section  II),  Nc  and  Nf  are  set  to  100  and  150,  respec¬ 
tively.  Thus,  the  number  of  Cepstrum  features  is  100  for 
both  detection  and  classification.  For  the  PCA  method  (see 
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(b)  Typical  seismic  sensor  signals:  “Human  target”  (top  row)  and  “Animal  target  led  by  a  human”  (bottom  row) 

Fig.  2.  Sensor  signals  (Column  1)  along  with  the  extracted  features  generated  by  Cepstrum  (column  2),  PCA  (column  3)  and  SDF  (column  4) 


Algorithm  2  in  Section  II),  96%  of  the  total  variance  is  used 
as  the  cumulative  proportion  to  select  the  number  of  principal 
components  which,  in  turn,  is  the  number  of  PCA  features 
used  for  both  target  detection  and  classification.  For  SDF- 
based  feature  extraction  (see  Algorithm  3  and  Algorithm  4  in 
Section  II),  alphabet  size  |£|  =  20  has  been  used  for  detection 
and  |£|  =  30  for  classification.  In  the  training  phase,  for  a 
finite  set  of  test  samples  in  each  class,  the  design  parameter 
of  each  of  the  three  (binary)  classifiers,  SVM,  k-NN  and  SRC, 
is  determined  by  minimizing  the  total  misclassification  rate  as 
explained  above. 

A  three-way  cross-validation,  based  on  the  data  collected 
on  three  days  for  the  data  set  #1,  has  been  used  to  assess  the 
performance  of  the  three  feature  extractors,  Cepstrum,  PCA 
and  SDF,  for  target  detection  and  classification  by  using  time 
series  data  from  PIR  and  seismic  sensors.  The  data  sets  are 
divided  into  three  sets  by  date,  i.e.,  first  data  from  Day  1  and 
Day  2  are  used  for  training  and  the  classifiers  are  tested  on 
Day  3.  This  process  is  repeated  for  two  other  combinations:  (a) 
Day  3  and  Day  1  data  as  training  data  and  Day  2  as  test  data, 
and  (b)  Day  2  and  Day  3  data  as  training  data  and  Day  1  as  test 
data.  Thus,  three  different  combinations  of  training  and  testing 
have  been  performed  for  the  field  data  set  #1.  Similarly,  for  the 
data  set  #2,  a  four-way  cross-validation  has  been  used  based 
on  the  data  collected  on  four  days  to  assess  the  performance 
of  the  three  feature  extractors,  Cepstrum,  PCA,  and  SDF,  for 
target  detection  and  classification  from  both  PIR  and  seismic 
sensor  data.  Thus,  four  different  combinations  of  training  and 
testing  have  been  performed  for  the  field  data  set  #2. 


A.  Target  detection 

Table  II  shows  averaged  confusion  matrices  for  the  target 
detection  problem  for  each  of  the  three  feature  extractors, 
where  the  upper  part  pertains  to  data  set  #1  and  the  lower 
part  to  data  set  #2.  These  confusion  matrices  are  obtained  by 
applying  the  SRC  classifier  individually  on  PIR  and  seismic 
sensor  data  when,  in  the  cross  validation  process,  the  respec¬ 
tive  data  on  each  of  the  single  days  are  used  for  testing  and  the 
remaining  data  for  training;  the  results  of  each  test  scenario 
are  then  summed  up  to  form  the  respective  confusion  matrix 
in  Table  II.  Each  column  of  the  twelve  confusion  matrices 
(i.e.,  six  confusion  matrices  for  each  data  set)  represents 
the  instances  in  a  predicted  class  of  “No  Target”  or  ‘Target 
present,”  and  each  row  represents  the  instances  in  an  actual 
class  (i.e.,  the  ground  truth).  For  data  set  #1,  out  of  a  total  of 
220  scenarios  for  PIR  sensors,  Cepstrum  and  PCA  yield  196 
and  179  correct  decisions,  respectively,  while  SDF  makes  215 
correct  decisions;  the  corresponding  number  for  data  set  #2 
are:  out  of  a  total  of  420  scenarios  for  PIR  sensors,  Cepstrum 
and  PCA  yield  340  and  328  correct  decisions,  respectively, 
while  SDF  makes  415  correct  decisions.  The  results  are  similar 
for  seismic  sensors. 

Table  III  summarizes  the  average  rates  of  successful  target 
detection,  false  alarm,  and  wrong  detection  by  cross-validation 
for  both  data  sets  #1  and  #2,  when  either  Cepstrum,  PCA 
or  SDF  is  used  as  the  feature  extractor  for  target  detection 
along  with  the  three  different  classifiers:  SVM,  k-NN  and 
SRC.  In  terms  of  wrong  detection,  the  performance  of  SDF  is 
significantly  superior  to  that  of  both  Cepstrum  and  PCA  for  all 
three  classifiers.  It  is  seen  that,  for  data  set  #1,  SDF  has  yielded 
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higher  successful  target  detection  rates  than  Cepstrum  and 
PCA  with  consistently  small  false  alarm  rates.  It  is  also  seen 
that,  on  the  average,  SDF  yields  better  results  than  Cepstrum 
and  PCA  for  data  set  #2. 

B.  Target  classification 

Table  IV  shows  averaged  confusion  matrices  with  the  SRC 
classifier  for  classification  of  human  versus  animal  led  by 
human,  using  PIR  and  seismic  sensors,  respectively,  for  each 
of  the  three  feature  extractors,  where  the  upper  part  pertains 
to  data  set  #1  and  the  lower  part  to  data  set  #2.  The  re¬ 
sults  are  obtained  by  applying  SRC  individually  on  PIR  and 
seismic  sensor  data  when,  in  the  cross  validation  process, 
the  respective  data  on  each  of  the  single  days  are  used  for 
testing  and  the  remaining  data  for  training.  Each  column  of 
the  twelve  confusion  matrices  (i.e.,  six  confusion  matrices  for 
each  data  set)  represents  the  instances  in  a  predicted  class  of 
“Human”  or  “Animal,”  and  each  row  represents  the  instances 
in  an  actual  class  (i.e.,  the  ground  truth).  For  the  data  set  #1, 
out  of  a  total  of  110  scenarios  for  seismic  sensors,  Cepstrum 
and  PCA  yield  76  and  62  correct  classifications,  respectively, 
while  SDF  makes  84  correct  classifications;  the  corresponding 
number  for  data  set  #2  are:  out  of  a  total  of  220  scenarios  for 
seismic  sensors,  Cepstrum  and  PCA  yield  119  and  109  correct 
classifications,  respectively,  while  SDF  makes  158  correct 
classifications.  The  results  are  qualitatively  similar  for  PIR 
sensors. 

Table  V  summarizes  the  average  rates  of  successful  human 
classification,  false  alarm,  and  misclassification  by  cross- 
validation  for  both  data  sets  #1  and  #2,  when  either  Cepstrum, 
PCA  or  SDF  is  used  as  the  feature  extractor  for  target  detection 
along  with  the  three  different  classifiers:  SVM,  k-NN  and 
SRC.  In  terms  of  misclassification  in  each  of  the  data  sets 
#1  and  #2,  the  performance  of  SDF  is  significantly  superior 
to  that  of  both  Cepstrum  and  PCA  for  all  three  classifiers. 

It  is  also  observed  that,  in  most  of  the  scenarios,  usage 
of  PIR  sensors  resulted  in  better  classification  performance 
compared  to  that  of  seismic  sensors  for  both  detection  and 
classification.  Better  results  are  expected  if  fusion  techniques 
are  employed  to  take  advantage  of  the  cross-correlated  infor¬ 
mation  of  the  sensors,  which  is  a  topic  for  future  research. 

C.  Computational  costs 

This  subsection  presents  a  comparison  of  the  computational 
costs  (i.e.,  execution  time)  of  the  three  feature  extraction 
algorithms  under  consideration.  To  this  end,  typical  simulation 
costs  have  been  generated  based  on  a  data  set  consisting  of 
156  sets  of  time  series  as  training  samples  and  64  different  sets 
of  time  series  as  test  samples.  The  results  have  been  generated 
on  a  single  core  of  a  2.13  Ghz  CPU  with  6  GB  memory.  It  is 
noted  that  each  of  PCA  and  SDF  has  a  training  phase  which 
is  usually  performed  off-line,  while  Cepstrum  treats  training 
and  test  samples  uniformly. 

In  the  experiments,  the  Cepstrum  features  were  extracted 
in  ~  16  milliseconds  for  each  time  series.  The  training  phase 
for  PCA,  which  includes  finding  the  projection  matrix  and 
extracting  the  features  for  the  training  samples,  took  ~  1.8 


seconds  while  the  feature  for  each  test  sample  was  generated  in 
less  than  10  microseconds.  Finally,  The  training  phase  for  SDF, 
which  includes  partitioning  of  the  data  set  and  extraction  of 
features  for  the  training  samples,  took  ~  2.4  seconds  while  the 
feature  of  a  test  sample  was  generated  in  ~  37  milliseconds. 
Therefore,  for  real-time  applications,  the  operations  of  both 
SDF  and  Cepstrum  are  performed  in  the  order  of  milliseconds 
to  extract  features  from  test  time  series  with  a  slight  advantage 
toward  Cepstrum,  and  PCA  is  the  fastest  because  it  only 
accounts  for  a  linear  projection  for  extracting  the  features. 

V.  Summary,  Conclusions,  and  Future  Work 

This  paper  presents  a  comparative  evaluation  of  Cepstrum, 
principal  component  analysis  (PCA)  and  symbolic  dynamic 
filtering  (SDF)  as  feature  extractors  for  target  detection  and 
classification.  All  three  methods  have  been  tested  on  two  data 
sets,  consisting  of  passive  infrared  (PIR)  and  seismic  sensors, 
collected  from  different  fields.  The  underlying  algorithms  of 
feature  extraction  have  been  executed  in  conjunction  with 
three  different  classification  algorithms,  namely,  support  vector 
machines  (SVM),  k-nearest  neighbor  (k-NN),  and  sparse  rep¬ 
resentation  classifier  (SRC).  Cross-validation  has  been  used 
to  assess  the  performance  of  Cepstrum,  PCA  and  SDF  as 
feature  extractors  for  both  PIR  and  seismic  sensor  data  sets. 
The  results  show  consistently  superior  performance  of  SDF- 
based  feature  extraction  over  both  Cepstrum-based  and  PCA- 
based  feature  extraction  in  terms  of  successful  detection,  false 
alarm,  and  wrong  detection  and  classification  decisions.  The 
rationale  for  superior  performance  of  SDF  over  Cepstrum  and 
PCA  as  a  feature  extractor  is  presented  below. 

Cepstrum  features  are  extracted  using  only  the  local  in¬ 
formation  of  a  given  time  series  without  making  use  of  the 
training  data.  In  contrast,  PCA  utilizes  the  training  data  to 
find  the  linear  transformation  and  approximates  the  subspace 
in  which  most  of  the  variance  of  the  training  data  lies. 
Although  this  is  an  advantage  of  PCA  over  Cepstrum,  a  linear 
transformation  is  not  necessarily  a  good  method  of  feature 
extraction  from  the  data;  this  might  indeed  be  a  possible  reason 
of  degraded  classification  performance  of  PCA.  On  the  other 
hand,  SDF  is  a  nonlinear  feature  extraction  algorithm  that 
extracts  the  pertinent  information  as  a  feature  vector  and  makes 
use  of  the  full  training  data  to  find  the  appropriate  partitioning 
of  the  test  time  series. 

Topics  for  future  research  in  this  area  include  further 
theoretical  study  on  refinement  of  the  SDF  algorithms  and 
field  testing  under  different  environments.  In  this  regard,  a 
few  key  topics  are  delineated  below. 

1)  Investigation  of  the  impact  of  finite  length  of  training 
and  test  data  sets  on  classification  performance  [22]  [12]. 

2)  Comparison  with  additional  methods  of  feature  extrac¬ 
tion  (i.e.,  besides  Cepstrum  and  PCA)  in  conjunction 
with  other  classification  algorithms  (i.e.,  besides  SVM, 
k-NN  and  SRC). 

3)  Development  of  a  rigorous  field  test  procedure  for  vali¬ 
dating  robustness  of  SDF-based  feature  extraction  under 
different  environmental  conditions  and  data  types. 
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4)  Performance  evaluation  of  SDF  as  a  feature  extractor  for 
classification  of  different  types  of  human  motion,  such 
as  walking  and  running. 

5)  Fusion  of  different  types  of  sensors  to  extract 
cross-correlated  information  between  the  informational 
sources  to  improve  the  classification  performance. 

References 

[1]  J.  Altmann,  “Acoustic  and  seismic  signals  of  heavy  military  vehicles  for 
co-operative  verification,”  Proceedings  of  the  IEEE ,  vol.  273,  no.  4-5, 
p.  713740,  2004. 

[2]  Y.  Tian  and  H.  Qi,  “Target  detection  and  classification  using  seismic 
signal  processing  in  unattended  ground  sensor  systems,”  in  International 
Conference  on  Acoustics  Speech  and  Signal  Processing  (ICASSP),  13-17 
May  2002. 

[3]  D.  Li,  K.  D.  Wong,  Y.  H.  Hu,  and  A.  M.  Sayeed,  “Detection,  clas¬ 
sification,  and  tracking  of  targets,”  IEEE  Signal  Processing  Magazine , 
vol.  19,  no.  2,  pp.  17-29,  2002. 

[4]  G.  Succi,  D.  Clapp,  R.  Gampert,  and  G.  Prado,  “Footstep  detection  and 
tracking,”  Unattended  Ground  Sensor  Technologies  and  Applications  III, 
vol.  4393,  no.  1,  pp.  22-29,  2001. 

[5]  G.  Mallapragada,  A.  Ray,  and  X.  Jin,  “Symbolic  dynamic  filtering  and 
language  measure  for  behavior  identification  of  mobile  robots,”  IEEE 
Transactions  on  Systems,  Man,  and  Cybernetics,  Part  B ,  vol.  42,  no.  3, 
pp.  647-659,  2012. 

[6]  A.  Ray,  “Symbolic  dynamic  analysis  of  complex  systems  for  anomaly 
detection,”  Signal  Processing,  vol.  84,  pp.  1115-1130,  July  2004. 

[7]  V.  Rajagopalan  and  A.  Ray,  “Symbolic  time  series  analysis  via  wavelet- 
based  partitioning,”  Signal  Processing,  vol.  86,  no.  11,  pp.  3309-3320, 
2006. 

[8]  A.  Subbu  and  A.  Ray,  “Space  partitioning  via  Hilbert  transform  for  sym¬ 
bolic  time  series  analysis,”  Applied  Physics  Letters,  vol.  92,  pp.  084107- 
1,  February  2008. 

[9]  X.  Jin,  S.  Sarkar,  A.  Ray,  S.  Gupta,  and  T.  Damarla,  “Target  detection 
and  classication  using  seismic  and  PIR  sensors,”  IEEE  Sensors  Journal, 
vol.  12,  no.  6,  pp.  1709-1718,  2012. 


[10]  D.  Childers,  D.  Skinner,  and  R.  Kemerait,  “The  cepstrum:  A  guide  to 
processing,”  Proceedings  of  the  IEEE,  vol.  65,  no.  10,  pp.  1428-1443, 
1977. 

[11]  N.  Nguyen,  N.  Nasrabadi,  and  T.  Tran,  “Robust  multi-sensor  classi¬ 
fication  via  joint  sparse  representation,”  in  Proceedings  of  the  14th 
International  Conference  on  Information  Fusion  (FUSION),  pp.  1-8, 
5-8  July  2011. 

[12]  J.  Wright,  A.  Yang,  A.  Ganesh,  S.  Sastry,  and  Y.  Ma,  “Robust  face 
recognition  via  sparse  representation,”  The  IEEE  Transactions  on  Pattern 
Analysis  and  Machine  Intelligence,  vol.  31,  no.  2,  pp.  210-227,  2008. 

[13]  C.  M.  Bishop,  Pattern  Recognition  and  Machine  Learning.  Springer, 
New  York,  NY,  USA,  2006. 

[14]  Z.  Zhang,  X.  Gao,  J.  Biswas,  and  K.  Wu,  “Moving  targets  detection  and 
localization  in  passive  Infrared  sensor  networks,”  in  Proceedings  of  the 
10th  International  Conference  on  Information  Fusion,  9-12  July  2007. 

[15]  R.  A.  Gramann,  M.  Bennett,  and  T.  D.  Obrien,  “Vehicle  and  personnel 
detection  using  seismic  sensors,”  Sensors,  C3I,  Information,  and  Train¬ 
ing  Technologies  for  Law  Enforcement,  vol.  3577,  no.  1,  p.  7485,  1999. 

[16]  A.  Oppenheim  and  R.  Shafer,  Digital  Signal  Processing.  Prentice  Hall, 
NJ,  USA,  1975. 

[17]  P.  Adenis,  K.  Mukherjee,  and  A.  Ray,  “State  splitting  and  state  merging 
in  probabilistic  finite  state  automata,”  in  Proceedings  of  2011  American 
Control  Conference,  pp.  5145-5150,  2011. 

[18]  A.  Berman  and  R.  Plemmons,  Nonnegative  Matrices  in  the  Mathematical 
Sciences.  SIAM,  Philadelphia,  PA,  USA,  1994. 

[19]  D.  Donoho,  “Compressed  sensing,”  IEEE  Transactions  on  Information 
Theory,  vol.  52,  pp.  1289-1306,  April  2006. 

[20]  E.  Candes  and  T.  Tao,  “Near-optimal  signal  recovery  from  random 
projections:  Universal  encoding  strategies?,”  IEEE  Transactions  on 
Information  Theory,  vol.  52,  pp.  5406-5425,  December  2006. 

[21]  B.  Scholkpf  and  A.  Smola,  Learning  with  kernels  Support  vector  ma¬ 
chines,  regularization,  optimization,  and  beyond.  MIT  Press,  Cambridge, 
MA,  USA,  2002. 

[22]  Y.  Wen,  K.  Mukherjee,  and  A.  Ray,  “Adaptive  pattern  classification  for 
symbolic  dynamic  systems,”  Signal  Processing,  vol.  93,  no.  1,  pp.  252- 
260,  2013. 


TABLE  I 

Number  of  different  scenarios  in  each  day  for  the  data  sets  #1  and  #2 


Data  set  #1  Data  set  #2 


Target 

Dayl 

Day2 

Day3 

Total 

Dayl 

Day2 

Day3 

Day4 

Total 

No  target 

32 

28 

50 

110 

6 

56 

82 

66 

210 

Human 

14 

22 

30 

66 

6 

21 

47 

32 

106 

Animal 

18 

6 

20 

44 

0 

35 

35 

34 

104 

TABLE  II 

Confusion  matrices  obtained  by  the  SRC  classifier  using  PIR  and  seismic  sensors  of  data  sets  #1  and  #2  for  target  detection. 
Each  confusion  matrix  is  obtained  by  summing  up  the  confusion  matrices  that  are  generated  by  cross  validation. 

PIR  Seismic 


No  Target  Target  Present  No  Target  Target  Present 


,,  .  No  Target 

s  Cepstrum  Target 

91  19 

5  105 

104  6 

11  99 

«  PCA  No  Target 

n  Target  Present 

95  15 

26  84 

70  40 

39  71 

SDF  No  Tar§et 

Target  Present 

108  2 

3  107 

107  3 

9  101 

No  Target 

s  cepstrum  Target  PrSesent 

199  11 

69  141 

179  31 

54  156 

«  PCA  No  Target 

n  Target  Present 

153  57 

35  175 

43  167 

28  182 

SDF  N°  Target 

Target  Present 

207  3 

2  208 

198  12 

28  182 
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TABLE  III 

Results  of  three-way  cross-validation  for  target  detection  using  PIR  and  seismic  sensors  of  data  set  #1 


Successful  Target  Detection  False  Alarm  Wrong  Decision 


Cepstrum 

PCA 

SDF 

Cepstrum 

PCA 

SDF 

Cepstrum 

PCA 

SDF 

SVM 

0.97 

0.95 

0.97 

0.05 

0.00 

0.00 

3.64% 

2.27% 

1.36% 

PIR 

k-NN 

0.95 

0.90 

0.97 

0.00 

0.00 

0.00 

2.72% 

5.00% 

1.36% 

aS 

1 75 

SRC 

0.95 

0.76 

0.97 

0.17 

0.14 

0.02 

10.91% 

18.64% 

2.27% 

08 

« 

SVM 

0.87 

0.60 

0.94 

0.14 

0.14 

0.09 

13.18% 

27.27% 

7.73% 

Q 

Seismic 

k-NN 

0.78 

0.15 

0.94 

0.25 

0.05 

0.08 

23.64% 

45.00% 

7.27% 

SRC 

0.90 

0.65 

0.92 

0.05 

0.36 

0.03 

7.72% 

35.91% 

5.45% 

SVM 

1.00 

1.00 

0.99 

0.02 

0.00 

0.01 

1.43% 

0.23% 

1.43% 

s? 

PIR 

k-NN 

1.00 

0.88 

1.00 

0.01 

0.01 

0.01 

0.24% 

6.43% 

0.71% 

a$ 

1 75 

SRC 

0.67 

0.83 

0.99 

0.05 

0.27 

0.01 

19.05% 

21.90% 

1 . 19% 

a 

« 

SVM 

0.88 

0.91 

0.90 

0.16 

0.08 

0.07 

14.05% 

8.33% 

8.57% 

Q 

Seismic 

k-NN 

0.89 

0.59 

0.88 

0.10 

0.01 

0.08 

10.71% 

20.95% 

10.00% 

SRC 

0.74 

0.83 

0.87 

0.15 

0.73 

0.05 

20.24% 

45.00% 

9.29% 
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TABLE  IV 

Confusion  matrices  obtained  by  SRC  classifier  for  PIR  and  seismic  sensors  of  data  sets  #1  and  #2  for  human/animal 

CLASSIFICATION. 


PIR 


Seismic 


Human  Animal 


Human  Animal 


„  .  Human 

Cepstrum  .  .  , 

1  Animal 


51 

25 


15 

19 


56 

24 


10 

20 


l  pca  i'umai: 

tS  Animal 

61  5 

21  23 

49  17 

31  13 

Human 
SDF  Animal 

57  9 

16  28 

55  11 

15  29 

g  Cepstrum  Human 
^  Animal 

64  42 

52  52 

71  35 

56  48 

l  pca  i'u,mi; 

a  Animal 

59  47 

33  71 

16  90 

11  93 

Human 
SDF  Animal 

83  23 

54  50 

91  15 

37  67 
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TABLE  V 

Results  of  three-way  and  four-way  cross-validation  for  human/animal  classification  of  data  set  #1  and  #2,  respectively. 


Successful  Human  Classification  Human  False  Alarm  Wrong  Decision 


Cepstrum 

PCA 

SDF 

Cepstrum 

PCA 

SDF 

Cepstrum 

PCA 

SDF 

SVM 

0.73 

0.92 

0.88 

0.45 

0.43 

0.20 

34.55% 

21.82% 

15.45% 

PIR 

k-NN 

0.88 

0.91 

0.89 

0.41 

0.52 

0.34 

23.64% 

26.36% 

20.00% 

CJ 

VI 

SRC 

0.77 

0.92 

0.86 

0.57 

0.48 

0.36 

36.36% 

23.64% 

22.73% 

ct 

as 

SVM 

0.68 

0.24 

0.85 

0.59 

0.45 

0.41 

41.82% 

63.64% 

25.45% 

a 

Seismic 

k-NN 

0.70 

0.55 

0.86 

0.86 

0.68 

0.50 

52.73% 

54.55% 

28.18% 

SRC 

0.85 

0.74 

0.83 

0.55 

0.70 

0.34 

30.91% 

43.64% 

23.64% 

SVM 

0.76 

0.61 

0.89 

0.42 

0.58 

0.38 

32.86% 

48.09% 

24.76% 

PIR 

k-NN 

0.73 

0.59 

0.76 

0.72 

0.46 

0.52 

49.05% 

43.33% 

37.62% 

"5 

SRC 

0.60 

0.56 

0.78 

0.50 

0.32 

0.52 

44.76% 

38.10% 

36.67% 

03 

W 

SVM 

0.63 

0.92 

0.91 

0.38 

0.53 

0.38 

37.62% 

30.48% 

23.81% 

o 

Seismic 

k-NN 

0.56 

0.97 

0.77 

0.36 

0.69 

0.41 

40.00% 

35.71% 

31.90% 

SRC 

0.67 

0.15 

0.86 

0.54 

0.10 

0.36 

43.33% 

48.10% 

24.76% 
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